SVM-based classification and feature selection methods for the analysis of Inflammatory Bowel disease microbiome data
نویسندگان
چکیده
Motivation: The human gut is one of the most densely populated microbial communities in the world. The interaction of microbes with human host cells is responsible for several disease conditions and of criticality to human health. It is imperative to understand the relationships between these microbial communities within the human gut and their roles in disease. Methods: In this study we analyze the microbial communities within the human gut and their role in inflammatory bowel disease (IBD). The bacterial communities were interrogated using Length Heterogeneity (LH-PCR) fingerprinting of mucosal and luminal associated microbial communities during healthy and diseases states. We develop support vector machine based classification and feature selection techniques to differentiate between healthy controls and patients suffering from IBD. Moreover, we develop site-specific classifiers to analyze community differences on the inner lining of the intestine (called mucosa) and the fluid within the intestine (called lumen).We also determine differentially abundant features across the different samples. Results: Using SVM-based classifiers with feature selection, we can distinguish the communities between the healthy controls and disease class patients. We also report differentially abundant features that exist between the different patient groups. The site-specific analysis provides an understanding of the microbial community differences between the lumen and mucosa of the healthy controls and patients suffering from IBD.
منابع مشابه
Feature Selection and Classification of Microarray Gene Expression Data of Ovarian Carcinoma Patients using Weighted Voting Support Vector Machine
We can reach by DNA microarray gene expression to such wealth of information with thousands of variables (genes). Analysis of this information can show genetic reasons of disease and tumor differences. In this study we try to reduce high-dimensional data by statistical method to select valuable genes with high impact as biomarkers and then classify ovarian tumor based on gene expression data of...
متن کاملMental Arithmetic Task Recognition Using Effective Connectivity and Hierarchical Feature Selection From EEG Signals
Introduction: Mental arithmetic analysis based on Electroencephalogram (EEG) signal for monitoring the state of the user’s brain functioning can be helpful for understanding some psychological disorders such as attention deficit hyperactivity disorder, autism spectrum disorder, or dyscalculia where the difficulty in learning or understanding the arithmetic exists. Most mental arithmetic recogni...
متن کاملDeveloping a New Method in Object Based Classification to Updating Large Scale Maps with Emphasis on Building Feature
According to the cities expansion, updating urban maps for urban planning is important and its effectiveness is depend on the information extraction / change detection accuracy. Information extraction methods are divided into two groups, including Pixel-Based (PB) and Object-Based (OB). OB analysis has overcome the limitations of PB analysis (producing salt-pepper results and features with hole...
متن کاملClassification of Right/Left Hand Motor Imagery by Effective Connectivity Based on Transfer Entropy in EEG Signal
The right and left hand Motor Imagery (MI) analysis based on the electroencephalogram (EEG) signal can directly link the central nervous system to a computer or a device. This study aims to identify a set of robust and nonlinear effective brain connectivity features quantified by transfer entropy (TE) to characterize the relationship between brain regions from EEG signals and create a hierarchi...
متن کاملFeature Selection Using Multi Objective Genetic Algorithm with Support Vector Machine
Different approaches have been proposed for feature selection to obtain suitable features subset among all features. These methods search feature space for feature subsets which satisfies some criteria or optimizes several objective functions. The objective functions are divided into two main groups: filter and wrapper methods. In filter methods, features subsets are selected due to some measu...
متن کامل